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METHOD FOR STABILIZING HETEROLOGOUS PROTEIN 
EXPRESSION AND VECTORS FOR USE THEREIN 



Technical Field of the Invention 

The present invention relates generally to the 
field of biotechnology. More particularly, the invention 
relates to the fields of protein expression and recombi- 
nant DNA technology to improve the yield of poorly ex- 
pressed mammalian polypeptides in bacterial hosts. 



Background of the Invention 

Many eukaryotic proteins are not capable of be- 
ing expressed in Escherichia coli in any measurable yield, 
or even if detectable, are not capable of being expressed' 
at such commercially recoverable levels due to proteolysis 
of the foreign protein by the host. Small proteins (e.g., 
peptide hormones of less than 100 amino acids) appear to ' 
25 be especially sensitive to degradation. The degree of 
proteolysis varies from host to host and protein to 
protein. Possibly the highest level of expression of a 
eukaryotic protein in E. coli has been observed with gamma 
interferon, which was expressed at approximately 60% of 
30 total cellular protein. The high level of expression of a 
few eukaryotic proteins has been achieved because they 
reach a concentration in the cell where they can aggregate 
into insoluble masses called inclusion or refractile bod- 
ies (e.g., bovine growth hormone; Schoner et al (1985), 
35 Biotechnology 3:151-154). In this form, the eukaryotic 
protein is less susceptible to proteolysis. 
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Proteins which do not become insoluble on their 
own do in some cases form inclusion bodies if joined to 
another protein such as a procaryotic protein. A small 
number of prokaryotic proteins have been used in this man- 
ner: E. coli lac Z/ trpE, and recA genes and the lambda 
cll gene, for example. 

Chloramphenicol acetyl transferase (CAT) has been 
used as a selectable marker (resistance to 
chloramphenicol), as an easily assayed enzyme to monitor 
the efficiency of both eukaryotic and prokaryotic expres- 
sion from different promoters (Delegeane, A.M. , et al- 
(1987) Mol cell Biol 7:3994-4002), regulatory sequences, 
and/or ribosome binding sites, and for gene fusions which 
join sequences encoding a eukaryotic protein to the 
nucleotide sequence encoding mature, native CAT (Buckley 
and Hayashi (1986) Mol Gen Germt-. ? nd,nn_,oc. European 
Patent Publication 161,937, published 21 November 1985) or 
to the carboxy terminal fragment of CAT (usually retaining 
CAT activity) . * 

While the literature establishes that fusion 
proteins are useful to express heterologous proteins in. 
bacteria and that the native CAT gene sequence has been 
used for such a purpose, efforts to use a truncated form 
of CAT to express or to increase the recoverable yield of 
heterologous, mammalian proteins such as amyloid protein 
A4-751 insert sequence, glucagon-like peptide I, 
adipsin/D, and lung surfactant SP-B and SP-C, have not 
been reported. In light of the fact that many important 
proteins cannot be successfully expressed in bacteria in 
any commercially recoverable yield, there is a need to 
develop systems for the bacterial expression and recovery 
of such proteins. 
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Disclosure of the Invention 

One aspect of the invention concerns a method of 
stabilizing heterologous protein expression in a 
prokaryotic host comprising: 
5 (a) constructing a hybrid gene comprising in 

sequential order, a 3' truncated chloramphenicol 
acetyl transferase (CAT) gene sequence fused in frame with 
a heterologous gene sequence encoding a mammalian 
polypeptide selected from the group consisting of amyloid 

10 protein A4-751 insert sequence, glucagon-like peptide I, 
adipsin/D, lung surfactant protein SP-B and lung 
surfactant protein SP-C; wherein said polypeptide is 
normally not recoverable in bacterial expression systems, 
and wherein said hybrid gene, upon translation, produces a 

15 fusion protein in a recoverable yield; 

(b) providing a vector fox expression of said 
hybrid gene; 

(c) culturing the prokaryotic host transformed 
with the expression vector; and 

20 (d) recovering the fusion protein. 

A second aspect of the invention concerns a 
bacterial expression vector capable of enhancing the level 
of expression of non-stable, bacterially produced 
heterologous polypeptides comprising a hybrid gene having, 

25 in sequential order, a 3' CAT truncated gene sequence 

fused in frame to a heterologous gene sequence encoding a 
mammalian polypeptide selected from the group consisting 
of amyloid protein A4-751 insert sequence, glucagon-like 
peptide I, adipsin/D, lung surfactant protein SP-B and 

30 lung surfactant protein SP-C, wherein said polypeptide is 
normally not recoverable in bacterial expression systems; 
whereby said truncated CAT gene sequence is capable of 
rendering the resulting fusion protein resistant to 
proteolyt ic degr adat ion . 

35 A preferred embodiment for both the method and 

vector of the present invention employs a CAT coding 



WO 90/01540 



PCT/US89/03417 



_4_ 

sequence of less than, or equal to 180 amino acids, 
preferably between 73 and 180 amino acids. Although the 
resulting CAT protein is substantially reduced as compared 
to the native CAT protein, surprisingly, it has been found 
5 that the truncated CAT protein substantially contributes 
to the stability of the expressed protein and therefore, 
permits recovery of an increased yield of the desired 
heterologous protein. 

Yet another aspect of the invention provides an 

10 improved bacterial expression vector capable of enhancing 
the level of expression of non-stable, bacterially 
produced heterologous polypeptides wherein said vector 
contains a hybrid gene having in sequential order, a 
modified 3' truncated CAT gene sequence linked to a 

15 heterologous gene sequence. The improvement comprises 

altering one or more DNA codons of the truncated CAT gene 
to eliminate potential chemical cleavage sites within the 
CAT protein. 

Other aspects of the invention will be readily 

20 apparent to those of skill in the art from the description 
and examples which follow. 

Brief Description of the Drawings 

Figure 1 sets forth the amino acid and cor- 

25 responding nucleotide sequences for a 241 amino acid (aa) 
CAT-hANP hybrid protein containing an endoproteinase Glu-C 
proteolytic cleavage site. The amino terminal portion of 
this hybrid protein encodes the first 210 amino acids of 
CAT, which sequence is extensively referred to throughout 

30 the present invention. 

Figure 2 illustrates a series of vectors and 
synthetic fragments used for cloning and expression of the 
CAT-hANF hybrid proteins 6f the invention. Figure 2A 
depicts an EcoRI-PstI synthetic fragment containing the 

35 coli trp promoter-operator sequence, a ribosomal binding 
site, and downstream cloning sites. Figure 2B is a 
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restriction site and function map of plasmid pTrp233. 
Figure 2C is a restriction site and function map of 
plasmid pCAT21. Figure 2D is an EcoRI-Hindlll synthetic 
fragment encoding the hANP (102-126) gene preceded by an 
5 endoproteinase Glu-C cleavage site. Figures 2E through G 
are restriction site and function maps of plasmids phNF75, 
pChNF109, and pChNF121, respectively. Figure 2H depicts a 
synthetic 1-73 aa CAT gene sequence contained within Ndel- 
Hindlll fragment. Figure 21 is a restriction site and 

10 function map of plasmid pChNF142 wherein site-specific 

mutagenesis was used to substitute Tyr and Ser codons for 
residues 16 and 31, respectively, of the CAT gene. 

Figure 3 illustrates two different preparative 
SDS-polyacrylamide gels. Figure 3A is an SDS- 

15 polyacrylamide gel of the CAT-A4-75H hybrid protein. 

Lane 1 = molecular size standards; Lane 2 = induced W3110 
(pCAPil32); Lane 3 = induced W3110 (pTrp83) vector 
control; Lane 4 = uninduced W3110 (pCAPil36); and Lane 5 = 
induced W3110 (pCAPil36). Figure 3B is an SDS- 

20 polyacrylamide gel of the CAT-GLP-I hybrid protein. Lane 
1 = molecular size standard; Lane 2 = uninduced W3110 
(pCGLP139); Lane 3 = induced W3110 (pCGLP139); and Lane 4 
= induced W3110 (pTrp83) vector control. 

Figure 4 illustrates the amino acid and cor- 

25 responding nucleotide sequences for a CAT-A4-7511 hybrid 
protein and a CAT-GLP-I hybrid protein of the invention. 
Figure 4 A depicts the first 73 codons encoding the amino 
terminus of the CAT protein joined in-frame to the 
synthetic A4-751i gene preceded by a chemical cleavage and 

30 site encoded by Asn-Gly. Figure 4B depicts the first 73 
codons encoding the amino terminus of the CAT protein 
joined in-frame to the synthetic GLP-1 gene preceded by a 
Met codon. 

Figure 5 illustrates two plasmids, pCAT73 and 
35 pCAT210, in which the gene for tetracycline resistance is 
restored in these CAT expression vectors. 
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Figure 6 is the nucleotide sequence and cor- 
responding amino acid sequence of the SP-B expression 
construct pC210SP-B from the EcoR I site preceding the trp 
promoter region through the Hind i I I site containing the 
5 translation stop codon. The CAT, linker, and SP-B regions 
are identified therein, respectively, by the arrows. 

Figure 7 is a preparative SDS-polyacryiamide gel 
of the CAT: SP-B fusion protein. Lane A = molecular size 
standards; Lane B = induced W3110 cells containing pTrp233 
10 vector control; and Lane C = induced pC210SP-B/W3110 
cells . 

Figure 8 illustrates the nucleotide sequence and . 
corresponding amino acid sequence of the 251 residue 
CAT:SP-C fusion protein from plasmid pC210SP-C. The CAT 
15 gene, linker sequence and SP-B gene are sequentially 
identified therein by the arrows. 

Figure 9 provides the molecular weight 
determinations for each of the CAT:SP-C fusion proteins. 
Lane A = molecular size standards; Lane B = induced W3110 
20 cells containing pTrp233 vector control; Lane C = induced 
pC106SP-C; Lane D = pC149SP-C; Lane E = pC179SP-C; and 
Lane F = pC210SP-C. 

Figure 10 provides the cDNA and amino acid 
sequences for human adipsin/D. 

25 

Modes for Carrying Out the Invention 

A. Definitions 

As used herein the term "stabilizing protein 
30 expression" refers to a property of a fusion protein 
responsible for inhibiting proteolysis of a foreign 
protein by a recombinant host cell. 

"Insoluble" as referred to proteins intends a 
condition wherein a protein may be recovered only by 
35 extraction with detergents or chaotropic agents. Usually, 
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insoluble proteins are formed as a consequence of 
intracellular aggregation of the cloned gene products . 

"High protein expression" or "enhanced protein 
expression" refers to a level of expression wherein the 
5 fused protein can comprise 10% or more of the total 

protein produced by each cell. A preferred range for high 
protein expression levels is from 10-20% of total cell 
protein . 

As used herein, "non- recover able" refers to a 
10 level of expression wherein the desired protein may be 
detected using sensitive techniques, e.g., Western blot 
analysis, yet the protein is not commercially recoverable 
using conventional purification techniques such as SDS- 
polyacrylamide gel electrophoresis, gel filtration, ion 
15 exchange chromatography, hydrophobic chromatography, af- 
finity chromatography, or isoelectric focusing. 

"Mammalian" refers to any mammalian species, and 
includes rabbits, mice, dogs, cats, primates and humans, 
preferably humans . 
20 As used herein, the term "heterologous" proteins 

refers to proteins which are foreign to the host cell 
transformed to produce them. Thus, the host cell does not 
generally produce such proteins on its own. 

25 B. CAT Fusions 

CAT encodes a 219 amino acid mature protein and 
the gene contains a number of convenient restriction 
endonuclease sites (5 '- Pvu. il, EcoRI, Ddel, Nco l, and Sca l- 
3 ' ) throughout its length to test gene fusions for high 

30 level expression. These restriction sites may be used for 
ease of convenience in constructing the hybrid gene 
sequences of the invention or other sites within the gene 
sequence may be generated using techniques commonly known 
to those of skill in the art. Any of the resulting CAT 

35 sequences are considered useful so long as the resulting 
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CAT fusion retains the ability to enhance the expression 
of the desired heterologous peptide. 

The expression constructs of the invention can 
employ most of the CAT-encoding gene sequence or a 
5 substantially truncated portion of the sequence encoding 
an N- terminal portion of the CAT protein linked to the 
gene encoding the desired heterologous polypeptide. 'In 
one embodiment of the invention, the CAT portion of the 
fusion codes for about the N-terminal one-third of the CAT 
10 sequence . 

The expression constructs exemplified herein, 
which demonstrated enhanced levels of expression for a 
variety of heterologous proteins, utilize a number of 
varying lengths of the CAT protein ranging in size from 73 

15 to 210 amino acids. The 73 amino acid CAT fusion 

component is conveniently formed by digesting the CAT 
nucleotide sequence at the EcoRI restriction site. 
Similarly, the 210 amino acid CAT fusion component is 
formed by digesting the CAT nucleotide sequence with Seal . 

20 These, as well as other CAT restriction fragments, may 
then be ligated to any nucleotide sequence encoding a 
desired protein to enhance expression of the desired 
protein. 

Significantly, although the expression level of 
25 fusion protein (approximately 15-20% of total cell 

protein) was similar for the CAT (106 amino acid) - SP-C 
fusion and the CAT (210 amino acid) - SP-C fusion, it can 
be seen that the former case actually represents a 
significant increase in expression level for the desired 
30 SP-C polypeptide, since the SP-C polypeptide constitutes a 
substantially larger proportion of the total fusion 
protein in the former case. The ability to increase 
expression level for the desired polypeptide by reducing 
the size of the fused CAT protein sequence was quite an 
35 - unexpected finding in view of the experience of the prior 
art. In general, the prior art experience has been that 
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reduction in size of the bacterial leader sequence does 
not result in increased production of the fused 
heterologous polypeptide due to a concomitant larger 
reduction in the expression level of the fusion protein. 
5 With one exception, the various CAT-heterologous 

fusion proteins exemplified herein were found to be 
expressed in the range of approximately 10-20% of the 
total cell protein. Thus, the versatility of the CAT fu- 
sions, that is, the ability to use a variety of CAT coding 

10 sequences having the ability to enhance the expression of 
a desired protein, allows great flexibility of choice- when 
constructing CAT hybrid genes. 

The reading frame for translating the nucleotide 
sequence into a protein begins with a portion of the amino 

15 terminus of CAT, the length of which varies, continuing 
in-frame with or without a linker sequence into the 
protein to be expressed, and terminating at the carboxy 
terminus of the protein. An enzymatic or chemical cleav- 
age site may be introduced downstream of the CAT sequence 

20 to permit recovery of the cleaved product from the hybrid 
protein. Such cleavage sequences are known in the art as 
are the conditions under which cleavage can be effected. 
Following cleavage, the desired heterologous polypeptide 
can be recovered using known techniques of protein 

25 purification. Suitable cleavage sequences include, 

without limitation, cleavage following methionine residues 
(cyanogen bromide), glutamic acid residues (endoproteinase 
Glu-C), tryptophan residues (N-chlorosuccinimide with urea 
or with sodium dodecyl sulfate (SDS)) and cleavage between 

30 asparagine and lysine residues ( hydro xyl amine ) . 

To avoid internal cleavage within the CAT 
sequence, amino acid substitutions can be made using 
conventional site specific mutagenesis techniques (Zoller, 
M.J., and Smith, M. (1982), Nuc Acids Res 10 t 64 87-6500, 

35 and Adelman, J. P., et al (1983), DMA 2 t 183-193 ) . This is 
conducted using a synthetic oligonucleotide primer com- 
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plementary to a single-stranded phage DNA to be 
mutagenized except for limited mismatching, representing 
the desired mutation. Of course, these substitutions 
would only be performed when expression of CAT is not 
5 significantly affected. Where there is only one internal 
cysteine residue, as in the short CAT sequence, this 
residue may be replaced to help reduce multimerization 
through disulfide bridges. 

10 C. CAT Fusion Vectors 



CAT fusion sequence; procaryotic hosts are, of course, the 
most convenient for cloning procedures. Procaryotes most 
frequently are represented by various strains of E. coli ; 

15 however, other microbial strains may also be used. 

Plasmid vectors which contain replication sites, select- 
able markers and control sequences derived from a species 
compatible with the host are used; for example, E . coll is 
typically transformed using derivatives of pBR322, a 

20 plasmid derived from an E. coli species by Bolivar et al, 
Gene 2 :95 (1977). pBR322 contains genes for ampicillin 
and tetracycline resistance, and thus provides multiple 
selectable markers which can be either retained or 
destroyed in constructing the desired vector. 

25 In addition to the modifications described above 

which would facilitate cleavage and purification of the 
product polypeptide, the gene conferring tetracycline 
resistance may be restored to the exemplified CAT fusion 
vectors for an alternative method of plasmid selection and 

3 o maintenance . 



operator sequences have been exemplified in the present 
CAT vectors, different control sequences can be 
substituted for the trp regulatory sequences and are 
35. considered to be within the scope of the invention. Com- 
monly used procaryotic control sequences which are defined 



Procaryotic systems may be used to express the 



Although the E. coli tryptophan promoter- 
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hereln to include promoters for transcription initiation, 
optionally with an operator, along with ribosome binding 
site sequence, include such commonly used promoters as the 
beta-lactamase (penicillinase) and lactose (lac) promoter 
5 systems (Chang et al, Nature 198 ; 1056), the lambda-derived 
p_ promoter (Shimatake et al, Nature 292 ; 128 (1981)) and 
N-gene ribosome binding site, and the trp-lac (trc) 
promoter system (Amann and Brosius, Gene 40 ; 183 (1985)). 

Since the general utility of these CAT vectors 

10 have been established with very different mammalian 

peptides (ranging in protein size, the presence or absence 
of disulfide bonds, and being hydrophobic or hydrophilic 
in nature) vectors with unique restriction sites may be 
created or substituted for the pBR322-derived vector il- 

15 lustrated in the examples. 

D. Heterologous Protein Expression 

Amino terminal DNA sequences of CAT have been 
fused to DNA sequences encoding human polypeptides for 

20 high level expression in the bacterial host E. coli . The 
polypeptides described herein are relatively small mam- 
malian polypeptides ranging in size from about 30 to 76 
amino acid residues. Attempts to directly express, e.g., 
in a non-fused form, each of these polypeptides in 

25 bacteria have been unsuccessful, most likely due to the ■ 
proteolytic degradation which occurs upon translation of 
the raRNA product. In the case of extremely hydrophobic 
polypeptides, even attempts to express such polypeptides 
using beta-galactosidase fusions produced detectable but 

30 very low level amounts of protein. 

Examples of polypeptides that have been success- 
fully expressed to high level in bacteria using the 
truncated CAT fusions include a variety of mammalian 
polypeptides including amyloid protein A4-751 insert 

35 sequence, glucagon-like peptide I, adipsin/D, lung 

surfactant protein SP5 (SP-C), and lung surfactant SP18 
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(SP-B) . Preferably, the mammalian, protein is of human 
origin, although other sources are also contemplated to be 
within the scope of this invention. A4-751 is a 57 amino 
acid sequence identified within the precursor for the A4 
5 amyloid protein associated with Alzheimer's disease and 
shares homology with the Kunitz family of serine 
proteinase inhibitors (Ponte, P., et al (1988) Nature 
331:525-527; Tanzi, R.E., et al (1988) Nature 331 :528- 
530). Glucagon-like peptide I (GLP-I, 7-31) is a 31 amino 

10 acid hormone co-encoded in the glucagon gene which is a 
potent stimulator of insulin release (Mojsov, S., et al 
(1987) J Clin Inves 79 i 6 16-619). Adipsin/D is a serine 
protease synthesized in and secreted from adipocytes 
(Zusalak, K.M. , et al (1985) J Mol Cell Biol 5 :419). Lung 

15 surfactant SP-B is a 76 amino acid hydrophobic protein. 
Lung surfactant SP-C is a 35 amino acid hydrophobic 
protein. Both SP-B and SP-C greatly enhance spreading of 
surfactant phospholipids at an air: water interface. 

20 E. Hosts Exemplified 

Host strains used in cloning and procaryotic ■ 
expression herein are as follows: 

For cloning and sequencing, and for expression 
of construction under control of most bacterial promoters, 
25 E. coli strains such as MC1061, DH1, RR1, W3110, MM294, B, 
C600hfl, K803, HB101, JA221, and JM101 may be used. 

F. General Methods 

Recombinant DNA methods are described in 

30 Maniatis et al (1982), Molecular Cloning, Cold Spring 

Harbor Laboratory, Cold Spring Harbor, New York, when not 
specifically cited in the following examples. Methods are 
also described in the literature for visualizing inclusion 
bodies, isolating them from cells, then solubilizing, 

35 purifying, and cleaving the hybrid protein (e.g., Itakura, 
K. , et al (1977) Science 198a 1056-1063 ; Shine, J., et al 
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(1980) Nature 285 t 455-461) . Methods are also available, 
if necessary, for refolding the protein product 
(Creighton, T.E., Proceedings of Genex-UCLA Symposium, 
1985, Kingstones (in press). The teachings of all of 
5 these references are incorporated herein by reference. 

Examples 

I. Expression of Chloramphenicol Acetyl trans fe rase-Human 
10 Atrial Natriuretic Peptide Hybrid Proteins in Es cherichia 
coli. 

A. Expression vector pChNF109 . 

Expression vector pChNF109 encodes a 241 amino 

15 acid CAT-hANP hybrid protein containing an endoproteinase 
Glu-C proteolytic cleavage site (Fig. 1). Most of the CAT 
gene (amino acids 1-210) has been joined in-frame to the 
hANP ( 102-126 ) gene and cleavage site (26 amino acids) 
through a linker sequence (5 amino acids). The hANP 

20 polypeptide comprises about 10% of the hybrid protein. 

This vector was constructed from plasmids pTrp233, pCAT21, 
and phNF75 which supplied the plasmid backbone and trp 
promoter-operator, the CAT gene, and the hANP ( 102-126) 
gene and cleavage site, respectively. 

25 

1. Construction of pChNF109 . 

Plasmid pTrp233 was constructed by insertion of 
a synthetic EcoR I-PstI fragment containing the E . coll trp 
promoter-operator sequence, a ribosomal binding site, and 

30 downstream cloning sites into plasmid pKK233-2-NdeI which 
contains strong transcription termination signals, T1T2, 
and the beta-lactamase gene. The synthetic fragment (see 
Fig. 2A) was assembled using the method of Vlasuk et al 
(1986), J. Biol Chem 261 ; 4789-4796 and its sequence 

35 confirmed by the method of Sanger et al (1977), Proc Natl 
Acad Sci USA 74; 5463-5467 in M13mp8 and Ml3mp9 . Plasmid 
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pKK233-2-NdeI (disclosed in co-pending U.S. Serial No. 
766,030, filed 8 May 1985 and incorporated herein by 
reference) was digested with EcoR I and Pstl, its termini 
dephosphorylated using calf intestinal phosphatase, and 
5 ligated with the synthetic EcoR I - Pst I fragment. Plasmid 
pTrp233 was isolated (Fig. 2B) from E. coli JA221 
transformed to ampicillin resistance. 

Plasmid pCAT21 was constructed by insertion of 
the CAT gene (from transposon Tn9, Alton and Vapnek, 

10 (1979) Nature 282 : 864-869 ) into plasmid pTrp233 under the 
control of the trp promoter-operator. Plasmid pAI»13ATCAT 
(a plasmid disclosed in co-pending U.S. Serial No. 
095,742, filed 11 September 1987 and incorporated herein 
by reference) was digested with Ndel and Hindlll and the 

15 approximately 750 bp Ndel-Hindlll fragment containing the 
CAT gene (with the initiating Met residue encoded at the 
Nde l site) was purified using agarose gel electrophoresis. 
The CAT gene was ligated with Nde l and Hindlll-digested 
pTrp233 using T4 DNA ligase. From E^ coli MC1061 

20 (Casadaban et al (1980), I Mol Biol 138 : 179-209) 

ampicillin- resistant trans formants, plasmid pCAT21 was 
isolated (Fig. 2C) . 

Plasmid phNF75 was constructed by insertion of a 
synthetic hANP gene preceded by a proteolytic cleavage 

25 site into plasmid pBgal (Shine et al (1980), Nature 

285:456). Eight oligodeoxyribonucleotides (Fig. 2D) were 
assembled into a synthetic hANP (102-126) gene preceded by 
an endoproteinase Glu-C cleavage site (method of Vlasuk et 
al (1986), supra). The synthetic DNA fragment (with a 5' 

30 EcoRI tail and a 3' blunt end) was ligated with Eco RI and 
Sma l restriction endonuclease digested M13mpl9 using T4 
DNA ligase for the purpose of DNA sequencing (method of 
Sanger et al (1977), supra ) . A clone with the correct 
sequence, M13-hNF7, was digested with BamH I and Bglll, the 

35 fragment containing the hANP gene purified by agarose gel 
electrophoresis, and the fragment ligated with BamH I- 
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digested and bacterial alkaline phosphatase 
dephosphorylated pTrp233 using T4 DNA ligase. A plasraid 
with the insert in the orientation which gives adjacent 
Hin di II, Bam HI and Eco RI sites at the 3' end of the hANP 
5 gene, phNF73, was identified by the size of the fragments 
generated by digestion with Hindi I I and PvuII. Plasmid 
phNF73 was digested with Eco RI , the hANP gene purified 
using polyacryl amide gel electrophoresis, and the gene 
ligated with EcoRI -digested and bacterial alkaline 
10 phosphatase dephosphorylated plasmid pBgal. From E. coli 
MC106I ampicillin-resistant trans formants r plasmid phNF75 
(Fig. 2E) was identified by the size of the DNA fragments 
generated by digestion with PstI and Hind i II. 

Expression vector pChNF109 was constructed by 
15 insertion of DNA fragments containing CAT, hANP and the 
proteolytic cleavage site, and a linker sequence into 
plasmid pTrp233. Plasmid phNF75 was digested with Eco RI 
and Hind i! I, the approximately 80 bp Eco RI- Hin dlll frag- 
ment containing hANP was purified by polyacrylamide gel 
20 electrophoresis, and ligated with EcoRI- and Hindi II- 

digested pTrp233 using T4 DNA ligase. From E. coli MC1061 
ampicillin-resistant trans formants , plasmid phNF87 was 
isolated and digested with Bam HI and the fragments were 
dephosphorylated using bacterial alkaline phosphatase. A 
25 Bam HI cassette containing the trp promoter-operator, 

ribosomal binding site, and large amino terminal fragment 
of the CAT gene was generated by digesting pCAT21 with 
Sea l , attaching BamH I synthetic linkers ( 5 ' -CGGATCCG-3 ' ) 
to the blunt termini using T4 DNA ligase, digesting the 
30 ligation with Bam HI and purification of the approximately 
740 bp BamH I fragment by agarose gel electrophoresis. The 
Bam HI cassette and plasmid phNF87 were ligated using T4 
ligase and ampicillin-resistant transformants of E. coli 
MC161 obtained. Plasmid pChNF109 (Fig. 2F), with the 
35 BamH I cassette in the orientation such that the CAT gene 
is fused in-frame to the endoproteinase Glu-C cleavage 



WO 90/01540 



-16- 



PCT/US89/03417 



site followed by the hANP gene, was selected on the basis 
of DNA fragment size in an EcoR I digest of the plasmid. 

2 . Expression of CAT (1-2 10) -hANP (102-126) 
5 Hybrid Protein From Plasmid pChNF109 . 

Plasmid pChNF109 expresses a CAT-hANP( 102-126) 
hybrid protein under the control of the E. coli trp ' 
promoter-operator. The plasmid was used to transform E. 
coli W3110 (ATCC Accession No. 27325) to ampicillin 

10 resistance and one colony was grown in culture overnight 
at 37°C in complete M9 medium containing M9 salts, 2 mM 
MgS04, 0.1 mM CaCl 2 , 0.4% glucose, 0.5% casamino acids, 40 
ug/ml tryptophan, 2 ug/ml thiamine hydrochloride, and 100 
ug/ml ampicillin sulfate. The overnight culture was 

15 diluted 100-fold into the same M9 medium described above 
(uninduced culture) and into M9 medium in which the 
tryptophan had been replaced by 25 ug/ml of 3-beta- 
indoleacrylic acid (induced culture). 

Expression was assessed after shaking the 

20 cultures for 6 hr at 37°C. The uninduced culture had 
reached a high cell density (stationary phase) and the 
induced culture was still at a low cell density 
(exponential phase). Phase-contrast microscopy revealed 
cells of normal morphology in the uninduced culture and 

25 elongated cells containing several refractile inclusion 
bodies in the induced culture. Total cell protein samples 
were prepared by boiling cell pellets in Laemmli buffer 
for 5 min and were analyzed by electrophoresis through a 
12% SDS-polyacrylamide gel followed by staining of the 

30 protein with Coomassie Blue. 

B. Expression Vector pChNF12l . 

Expression vector pChNF121 encodes a 99 amino 
acid CAT-hANP hybrid protein containing an endoproteinase 
35. Glu-C proteolytic cleavage site (Fig. 4A) . Approximately 
one-third of the CAT gene (amino acids 1-73) has been 
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fused to the hANP( 102-126 ) gene and proteolytic cleavage 
site (26 amino acids) without an intervening linker. The 
hANP polypeptide comprises 25% of the hybrid protein. 
This vector was constructed from plasmids pChNF109 and 
5 phNF87 which supplied the amino terminal fragment of the 
CAT gene and the hANP gene and proteolytic cleavage site, 
respectively. 

1. Construction of pChNF121 . 

10 Plasmid phNF87 was digested with EcoRI, its 

termini dephosphorylated with bacterial alkaline 
phosphatase, and ligated with an approximately 320 bp 
Eco RI fragment containing the trp promoter- opera tor, 
ribosome binding site, and amino-terminus of the CAT gene. 

15 This Eco RI cassette was purified from an EcoR I digest of 
PChNFl09 using agarose gel electrophoresis. Plasmid 
pChNF121 (Fig. 2G) was isolated from the arapicillin- 
resistant transformants of E. coli MC1061. On the basis 
of the size of the DNA fragments from a PvuII digest of 

20 the plasmid, the CAT and hANP genes were inferred to be 
fused in-frame to produce a hybrid protein. 

2. Expression of CAT( 1-73) -hANP ( 102-126) Hybrid 
Protein From Plasmid pChNF121 . 

25 Plasmid pChNF121 expresses a CAT-hANP< 102-126 ) 

hybrid protein under the control of the E. coli trp 
promoter-operator. The plasmid was used to transform E. 
coli W3110 (prototroph, TrpR+) to ampicillin resistance 
and one colony was grown in culture overnight at 37°C in 

30 complete M9 medium (see Section A.2.). The overnight 
culture was diluted 100-fold into complete M9 medium 
(uninduced culture) and into M9 medium with 25 ug/ml 3- 
beta-indole-acrylic acid replacing the 40 ug/ml tryp- 
tophan (induced culture). 

35 Expression was assessed after shaking the 

cultures for 6 hr at 37°C. The uninduced culture had 
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reached a high cell density whereas the induced culture 
reached about one-third this density. Phase contrast 
microscopy revealed cells of normal morphology in the 
uninduced culture and elongated cells with several 
5 refractile inclusion bodies in the induced culture. Total 
cell protein samples were prepared by boiling cell pellets 
in Laenimli buffer for 5 min. and were analyzed by 
electrophoresis through a 12% SDS-polyacrylamide gel fol- 
lowed by staining of the protein with Coomassie Blue. 

10 

C. Expression Vector pChNF142 . 
Expression vector pChNF142 encodes a 99 amino 
acid CAT-hANP hybrid protein containing a unique Trp 
residue following amino acid residue 73 of the CAT 

15 protein, as a site for chemical cleavage. Approximately 
one-third of the CAT gene (amino acids 1-73) has been 
fused to the hANP( 102-126} gene and chemical cleavage site 
(26 amino acids). This amino terminal fragment of CAT has 
been modified to substitute a Tyr residue for Trp [16] and 

20 a Ser residue for Cys[31] to remove the additional 

chemical cleavage site and reduce the multimerization of 
the hybrid protein through disulfide bridges. A synthetic 
hANP gene preceded by sequence encoding a Trp residue has 
been assembled for this vector. 

25 

1. Construction of pChNF142 . 
Plasmid pTrp233 was digested with EcoR I, its 
termini filled in with E. coli DNA polymerase I, Klenow 
fragment, and ligated with T4 DNA ligase (to remove the 

30 EcoR I restriction endonuclease cleavage site) . From 
ampicillin-resistant transformants of E. coli MC1061, 
plasmid pTrp81 was isolated and shown to resist cleavage 
by EcoR I . Plasmid pTrp8T was digested with Hde l and 
Hind lll, purified by agarose gel electrophoresis, and 

35 ligated with a synthetic CAT gene fragment using T4 DNA 
ligase. The synthetic Ndel-Hindlll CAT gene fragment 
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(Fig. 2H) was assembled from three pairs of oligo- 
deoxyribonucleo tides as previously described. From 
ampicillin-resistant transformants of E. coli MC1061, 
plasmid pCAT127 was isolated and shown to contain the 
5 synthetic CAT fragment by digestion with Eco RI and Aval. 
The plasmid was digested with BamH I and Hin dlll, the 
Bam HI- Hin dlll fragment containing CAT was purified by 
agarose gel electrophoresis, seguenced by the method of 
Sanger et al (1977), supra , and the correct DNA sequence 
10 confirmed. 

Plasmid pCAT127 was digested with EcoRI and - 
Hin di I I and ligated using T4 DNA ligase with a pair of 
annealed synthetic oligodeoxyribonucleo tides encoding 
hANP( 102-126) preceded by a Trp residue on an Eco RI - 

15 Hin di I I DNA fragment. Plasmid pChNF142 (Fig. 21) was 
isolated from ampicillin-resistant transformants of 
E. coli MC1061. Insertion of the hANP gene was confirmed 
by the size of the DNA fragments in a BamH I and Hin di I I 
digest of the plasmid. The sequence of the hANP gene was 

20 confirmed from an Eco RI- Sca l agarose gel purified fragment 
from pChNF142. 

2. Expression of CAT(l-73), Tyrfl61 Serf 311- 
hANPf 102-126) pChNF142 . 
25 The expression of a modified CAT-hANP( 102-126) 

hybrid protein is conducted in substantial accordance with 
the teaching of the previous examples A. 2 and B.2. 



II. Expression of Chloramphenicol Acetyl transferase — 
30 Amyloid A4 Protein Insert (A4-7511) Hybrid Proteins 

in Escherichia coli . 

In the following examples high level expression 
of the 57 amino acid insert within the amyloid A4-751 
protein was achieved by fusing a synthetic A4-7511 gene to 
35 DNA sequences encoding amino terminal fragments of CAT 
under the control of the E. coli tryptophan promoter- 



WO 90/01540 



PCT/US89/03417 



-20- 

operator on a pBR322-derived plasmid. The synthetic A4- 
751i gene encodes amino acids 289-345 from amyloid A4-751 
protein (Ponte et al (1988), Nature 331 ! 525-527) preceded 
by a chemical cleavage site, Asn-Gly. Hydroxylamine 
5 cleavage of the hybrid protein between these two residues 
will yield the insert protein with a Gly residue at its 
amino terminus. 

A. Expression Vector pCAPi!32 . 

!0 Expression vector pCAPil32 encodes a 132 amino 

acid CAT-A4751i hybrid protein containing a hydroxylamine 
cleavage site (Fig. 4A) . Approximately the amino terminal 
third of the CAT gene (amino acids 1-73) has been joined 
in-frame to the A4-7511 gene and cleavage site (59 amino 

15 acids). The A4-751i protein comprises about 43% of the 
hybrid protein. This vector was constructed from plasraids 
pTrp233 and pChNF121 and the synthetic A4-751i gene and 
cleavage site. 

20 l. Construction of pCAPi!32 . 

Plasmid pTrp233 was digested with EcoR I and 
Hind i I I r purified by agarose gel electrophoresis, and 
ligated with the synthetic gene encoding the A4-751i 
protein and cleavage site using T4 DBA ligase. The gene 

25 had been assembled from six oligodeoxyribonucleotides 
using previously described techniques and its sequence 
(Fig. 4A) confirmed. Plasmid pAPil31 was isolated from 
ampicillin- resistant transformants of E. coli MC1061. 
Insertion of the synthetic gene was confirmed by the size 

30 of the DNA fragments from a Pvu l and BamH I digest of 
plasmid mini -prep DNA. 

Plasmid pAPil31 was digested with EcoR I to 
linearize the vector and its termini dephosphorylated 
using bacterial alkaline phosphatase. Plasmid pChNF121 

35. was digested with EcoR I and the approximately 320 bp Eco RI 
fragment containing the trp promoter-operator, ribosome 
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binding site, and amino terminus of the CAT gene (amino 
acids 1-73) was purified by agarose gel electrophoresis. 
This Eco RI cassette was ligated with the pAPil31 plasmid 
using T4 DNA ligase and ampicillin-resistant transf ormants 
5 of MC1061 were obtained. On the basis of DNA fragment 
size in a PyuII digest of mini-prep plasmid DNA, plasmid 
pCAPil32 was isolated with an in- frame fusion of CAT and 
A4-751i sequences. 

10 2. Expression of CATf l-73)-A4-751i Hybrid 

Protein From Plasmid pCAPi!32 . 
Plasmid pCAPil32 expresses a CAT-A4-75H hybrid 
protein under the control of the E. coli trp promoter- 
operator. The plasmid was used to transform E. coli W3110 

15 to ampicillin resistance and one colony was grown in 
culture overnight at 37°C in complete M9 medium. The 
overnight culture was diluted 100-fold into complete M9 
medium which contains 40 ug/ml tryptophan (uhinduced 
culture) and into complete M9 medium containing 25 ug/ml 

20 3-beta-indoleacrylic acid instead of tryptophan (induced 
culture) . 

Expression was assessed after shaking the 
cultures for 6 hr at 37°C. The uninduced culture had 
reached a high cell density whereas the induced culture 

25 was at a lower cell density. Phase contrast microscopy 
revealed cells of normal morphology in the uninduced 
culture and cells with "pre-inclusion bodies" in the 
induced culture. As used herein, "pre-inclusion bodies" 
are defined as less refractile bodies which appear to 

30 convert in time to the more refractile "inclusion bodies" 
as the hybrid protein accumulates in the cells. Total 
cell protein samples were prepared by boiling cell pellets 
in Laemmli buffer for 5 min and then analyzed by 
electrophoresis through a 12% SDS-polyacrylamide gel fol- 

35 lowed by staining with Coomassie Blue (Fig. 3A) . This 
CAT( l-73)-A4-751i hybrid protein migrates between the 
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lysozyme (14,300 MW) and beta-lactoglobulin (18,400 MW) 

protein standards on this gel. Using a Kontes fiber optic 

scanner and Hewlett-Packard Integrator to scan the gel, 

the hybrid protein was estimated to comprise about 7% of 

5 the total cell protein. This is a moderate expression 

level of E. coli but A4-751i comprises almost half of the 

hybrid protein. 

To confirm the presence of A4-7511 in the hybrid 

protein, Western blot analysis was carried out on an 

10 unstained 12% SDS-polyacrylamide gel of these protein 

samples. Protein was blotted to nitrocellulose and 

incubated with anti-A4-751i serum (prepared against a 16 

amino acid synthetic peptide containing amino acids 11-26 

of the 57 amino acid insert protein). After . incubation 
125 

15 with 1-protein A (Araersham) the blot was placed on X- 
ray film at -70°C for several days. The synthetic peptide 
anti-serum detected the hybrid protein as well as several 
other E. coli proteins. 

20 B. Expression Vector pCAPi!36 . 

Expression vector pCAPil36 encodes a 274 amino 
acid CAT-A4-751i hybrid protein containing a hydroxylamine 
cleavage site. Most of the CAT gene (amino acids 1-210) 
has been joined in- frame to the A4-751i gene and cleavage 

25 site (59 amino acids) through a linker sequence (5 amino 
acids). The A4-751i polypeptide comprises about 21% of 
the hybrid protein. This vector was constructed from 
plasmids pAPil31 and pChNF109. 

30 1. Construction of pCAP1136 . 

Plasmid pAPil3l was digested with EcoR I to 
linearize the vector and its termini dephosphorylated 
using bacterial alkaline phosphatase. From a partial 
Eco RI digest of pChNF109 an approximately 740 bp Eco RI 

35 fragment containing the trp promoter-operator, the CAT 

gene (amino acids 1-210), and linker sequence was purified 
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by agarose gel electrophoresis. This Eco RI cassette and 
vector pAPi!31 were ligated using T4 DNA ligase and 
ampicillin-resistant transformants of E. coli MC1061 were 
isolated. From the size of DNA fragments in plasmid mini- 
5 preps digested with Bam HI, plasmid pCAPil36 was isolated 
with the CAT gene and the synthetic A4-75U gene in-frame. 

2. Expression of CATf 1-210 )-A4-751i Hybrid 
Protein From Plasmid pCAPi!36 . 
10 Plasmid pCAPil36 expresses a CAT-A4-7511 hybrid 

protein under the control of the E. coli trp promoter- 
operator. The plasmid was used to transform E . coli W3110 
to ampicillin resistance and one colony was grown in 
culture overnight at 37°C in complete M9 medium. The 
15 overnight culture was diluted 100-fold into the same M9 
medium (uninduced culture) and into. M9 complete medium 
containing 25 ug/ml 3-beta-indoleacrylic acid instead of 
tryptophan (induced culture). 

Expression was assessed after shaking the 

20 cultures for 6 hr at 37°C. Both the uninduced and induced 
cultures reached high cell densities. Phase contrast 
microscopy revealed cells of normal morphology in the 
uninduced cultures and cells containing inclusion bodies 
or pre- inclusion bodies (50:50) in the induced cultures. 

25 Total cell protein samples were prepared by boiling cell 
pellets in Laemmli buffer for 5 min and were analyzed by 
electrophoresis through a 12% SDS-polyacrylamide gel fol- 
lowed by staining with Cooraassie Blue (Fig. 3A) . This 
CAT-A4-751i hybrid protein migrates between the alpha- 

30 chymotrypsinogen (25,700 MW) and ovalbumin (43,000 MW) 

protein standards on this gel. Using a Kontes fiber optic 
scanner and Hewlett-Packard Integrator to scan the gel, 
the hybrid protein was estimated to comprises about 15% of 
total cell protein. This is moderately high level expres- 

35 sion for E. coli. 
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To confirm the presence of A4-7511 in the hybrid 
protein, Western blot analysis was carried out on an 
unstained 12% SDS-polyacrylaraide gel of these protein 
samples. Using the method described above (section II. 
5 A. 2 . ) , the synthetic peptide anti-serum detected the 

hybrid protein as well as several other E. coli proteins . 

Ill . Expression of Chloramphenicol Acetyl transferase — 
Glucaqon-Like Peptide I (7-37) Hybrid Protein in 

10 Escherichia coli . 

In the following example, high level expression 
of the 31 amino acid GLP-I (7-37) was achieved by fusing a 
synthetic 6LP-I gene to DNA sequences encoding an amino 
terminal fragment of CAT under the control of the E. coli 

15 tryptophan promoter-operator on a pBR322-derived plasmid. 
The synthetic gene encodes amino acids 7-37 of GLP-1 
(Mojsov et al (1987), J. Clin Invest 79 :616-619) preceded 
by a Met residue. Treatment with cyanogen bromide 
releases the insulinotropic peptide. 

20 

A. Expression Vector pCGLP139 . 
Expression vector pCGLP139 encodes a 105 amino 
acid CAT-GLP-I hybrid protein containing a cyanogen 
bromide cleavage site (Fig. 4B) . Approximately the amino 

25 terminal third of the CAT gene (amino acids 1-73) has been 
joined in- frame to the GLP-1 gene and cleavage site (32 
amino acids). The GLP-I peptide comprises about 30% of 
the hybrid protein. This vector was constructed from 
plasmids pTrp233 and pChNF109 and the synthetic GLP-I gene 

30 and cleavage site. 

1. Construction of PCGLP139 . 

Plasmid pTrp233 was digested with Eco RI and 
Hind i II, purified by agarose gel electrophoresis, and 
35 ligated with the synthetic gene using T4 DNA ligase. The 
gene had been assembled from four oligodeoxyribo- 
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nucleotides and its sequence (Fig. 4B) confirmed. From 
ampicillin-resistant trans formants of E. coli MC1061, 
plasmid pGLP138 was isolated. Insertion of the synthetic 
gene was confirmed by the failure of plasmid mini-prep DNA 
5 to be cut by Pst I . 



linearize the vector, its termini dephosphorylated using 
bacterial alkaline phosphatase, and ligated with the Eco RI 
cassette from plasmid pChNF109 using T4 DNA ligase. 

10 Plasmid pChNF109 had been digested with Eco RI and the ap- 
proximately 320 bp Eco RI fragment containing the trp 
promoter-operator, ribosome binding site, and an amino 
terminal fragment of the CAT gene purified by agarose gel 
electrophoresis. Plasmid pCGLP139 was isolated from 

15 ampicillin-resistant transformants of MC1061. On the 

basis of DNA fragment size in an Ava l and PvuII digest of 
plasmid mini-prep DNA, the fusion of CAT and GLP-I 
sequences was confirmed to be in- frame. 

20 2. Expression of CATf 1-73) -GLP-I (7-37) Hybrid 

Protein From Plasmid PCGLP139 . 
Plasmid pCGLP139 expresses a CAT-GLP-I hybrid 
protein under the control of the E . coli trp promoter- 
operator. The plasmid was used to transform E. coli W3110 

25 to ampicillin resistance and one colony was grown in 
culture overnight at 37°C in complete M9 medium. The 
overnight culture was diluted 100-fold into complete M9 
medium which contains 40 ug/ml tryptophan (uninduced 
culture) and into complete M9 medium in which 25 ug/ml 3- 

30 beta-indoleacrylic acid has been substituted for the 
tryptophan ( induced culture ) . 



reached a high cell density whereas the induced culture 
35 was at a lower cell density. Phase contrast microscopy 
revealed cells of normal morphology in the uninduced 



Plasmid pGLP138 was digested with Eco RI to 



Expression was assessed after shaking the 
cultures for 6 hr at 37°C. The uninduced culture had 
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culture and elongated cells with three or more refractile 
inclusion bodies in the induced culture. Total cell 
protein samples were prepared by boiling cell pellets in 
Laemmli buffer for 5 min and were analyzed by 
5 electrophoresis through a 12% SDS-polyacrylamide gel fol- 
lowed by staining with Coomassie Blue (Pig. 3B) . This 
CAT( 1-73 )-GLP-I( 7-37) hybrid protein migrates between the 
bovine trypsin inhibitor (6,200 MW) and lysozyme (14,300 
MW) protein standards. Using a Kontes fiber optic scanner 

10 and Hewlett-Packard Integrator to scan the gel, the hybrid 
protein was estimated to comprise about 20% of the total 
cell protein. (Considering the number of inclusion bodies 
observed per cell, all of the hybrid protein may not have 
been solubilized in the Laemmli buffer, and this estimate 

15 may be low. ) This is high level expression for E. coli . 

The molecular weight of the hybrid protein is as 
predicted for this gene fusion. Amino acid composition 
analysis of the purified hybrid protein or protein 
sequencing of the peptide after cyanogen bromide cleavage 

20 can be performed to confirm its expression. 

IV. CAT Fusion With Human SP-B and SP-C . 

The mature forms of both human SP-C and SP-B are 
expressed as fusions with portions of bacterial CAT. The 
25 surfactant peptides are joined to the carboxy terminus of 
the CAT sequences through a hydroxylamine-sensitive 
asparagine-glycine linkage. The CAT-surfactant fusions 
are expressed from the tryptophan promoter of the bacte- 
rial vector pTrp233. 

30 

A. Expression Vector PC210SP-B . 

SP-B expression vector pC210SP-B encodes a fu- 
sion protein of 293 residues in which 210 amino acids of 
CAT are joined to the 76 amino acids of SP-B through a 
35 linker of 7 amino acids containing the hydroxylamine- 
sensitive cleavage site. Cleavage of the fusion with 
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hydroxyl amine releases a 77 amino acid SP-B product 
containing the 76 residue mature form of SP-B, plus an 
amino-terminal glycine residue. 

To construct pC210SP-B, the short Eco RI- Hind lll 
5 segment containing ANF sequences was removed from 

pChNF109, and replaced by a portion of human SP-B cDNA #3 
extending from the Pst I site at nucleotide (nt) 643 (Fig. 
6) to the Sph I site at nt 804. The Eco RI site was joined 
at the Pst I site through two complementary 

10 oligonucleotides encoding the hydroxylamine sensitive 
cleavage site as well as the amino-terminal residues of 
mature SP-B (oligo #2307: 5'-AAT TCA ACG GTT TCC CCA TTC 
CTC TCC CCT ATT GCT GGC TCT GCA-3' and oligo #2308: 5'-GAC 
CCA GCA ATA GGG GAG AGG AAT GGG GAA ACC GTT G-3 ' ) - The 

15 Sph I site was joined to the Hind i I I site of PTrp233 

through a second set of complementary nucleotides encoding 
the carboxy- terminal residues of mature SP-B (oligo #3313: 
5 ' -AGC TTA CCG GAG GAC GAG GCG GCA GAC CAG CTG GGG CAG CAT 
G-3' and oligo #3314: 5' -CTG CCC CAG CTG GTC TGC CGC CTC 

20 GTC CTC CGG TA-3' ) . 

The expression plasmid was used to transform E. 
coli stain W3110 to ampicillin resistance. Rapidly grow- 
ing cultures of pC210SP-B/W3110 in M9 medium were made 25 
ug/ml IAA (3-beta indoleacrylate, Sigma 1-1625) to induce 

25 the Trp promoter. By 1 hr after induction, refractile 
cytoplasmic inclusion bodies were seen by phase contrast 
microscopy inside the still-growing cells. 5 hr after 
induction, the equivalent of 1 O.D. 55Q of cells were 
pelleted by centrifugation, then boiled for 5 min in SDS 

30 sample buffer for electrophoresis in a 12% SDS- 

polyacrylamide gel followed by staining with Coomassie 
Blue (Fig. 7). Lane A = molecular size standards; Lane B 
= induced W3110 cells containing pTrp233 vector control; 
and Lane C = induced pC210SP-B/W3110 . The predicted 

35 molecular weight of the CAT:SP-B fusion protein is 45,000 
daltons. The hybrid CAT: SP-B protein was estimated to 
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comprise 15-20% of the total cell protein in the induced 
cultures . 

B. CAT Fusions with SP-C . 
5 A series of vectors were constructed encoding 

fusion proteins in which mature human SP-C was fused to 
the carboxy termini of different portions of CAT through a 
hydroxylamine-sensitive asparagine-glycine linkage. 
Hydroxylamine cleavage of the fusion protein produced by 
10 each construct releases a mature SP-C of 35 amino acids 
which lacks the amino-terminal phenylalanine residue seen 
in a portion of natural human SP-C. 

1. PC210SP-C . 

15 The amino acid sequence of the 251 residue fu- 

sion protein encoded plasmid pC210SP-C. The 210 amino 
acids of CAT are joined to 35 amino acids of mature SP-C 
through a linker of 6 amino acids. The mature SP-C 
portion of the total fusion protein comprises 14%. 

20 In Fig. 8 is shown the nucleotide sequence of 

pC210SP-C, in which the EcoR I- Hin dlll fragment of pC210SP- 
B containing SP-B sequences has been replaced by a segment 
of human SP-C cDKA #18 extending from the ApaL I site at 
nucleotide 123 to the Ava il site at nucleotide 161. The 

25 EcoR l site of the CAT vector was joined to the SP5 ApaL I 
site through two complementary oligonucleotides encoding 
the hydroxylamine sensitive cleavage site as well as the 
amino-terminal residues of mature SP-C (oligo #2462: 5'- 
AAT TCA ACG GCA TTC CCT GCT GCC CAG-3' and oligo #2463: 

30 5 ' -TGC ACT GGG CAG CAG GGA ATG CCG TTG-3 ' ) . The Avail 
site of SP-C was joined to the Hindi I I site of pC210SP-B 
through a second set of complementary nucleotides encoding 
the carboxy- terminal residues of mature SP-C and a stop 
codon (oligo #2871: 5'-AGC TTA GTG GAG ACC CAT GAG CAG GGC 

35, TCC CAC AAT CAC CAC GAC GAT GAG- 3' and oligo #2B72: 5'-GTC 
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CTC ATC GTC GTG GTG ATT GTG GGA GCC CTG CTC ATG GGT CTC 
CAC TA-3 ' ) . 

2. PC179SP-C . 

5 The amino acid sequence of the 217 residue fu- 

sion protein encoded by pC179SP-C is a slight modification 
of the sequence shown in Fig. 8. In pC179SP-C, the 179 
amino acids of CAT are joined to 35 amino acids of mature 
SP-C through a linker of 3 amino acids (Glu, Phe, Asn) . 

10 SP-C portion of the total fusion protein comprises 16%. 

To construct pC179SP-C/ a portion of the CAT 
sequence was removed from pC210SP-C. Starting with 
pC210SP-C, a DNA fragment extending from the Nco l site at 
nt 603 (Fig. 8) to the Eco RI site at nt 728 was removed, 

15 and the Nco l and Eco RI cohesive ends were rejoined with 
two complementary oligonucleotides (oligo #3083: 5 '-CAT 
GGG CAA ATA TTA TAC GCA AG-3' and oligo #3084* 5'-AAT TCT 
TGC GTA TAA TAT TTG CC-3 ' ) . In effect, 31 residues of 
CAT, and 3 residues of the linker polypeptide are missing 

20 in the new fusion protein encoded by vector pC179SP-C. 

3. PC149SP-C . 

The amino acid sequence of the 187 residue fu- 
sion protein encoded by pC149SP-C is a slight modification 

25 of the sequence shown in Fig. 8. In plasmid pC149SP-C, 

the 149 amino acids of CAT are joined to 35 amino acids of 
mature SP-C through a linker of 3 amino acids (Glu, Phe, 
Asn) . The SP-C portion of the total fusion protein 
comprises 18.7%. 

30 To construct pC149SP-C, a portion of the CAT 

segment of pC210SP-C extending from the Dde l site at nt 
523 (Fig. 8) to the Eco RI site at nt 728 was removed and 
replaced by a set of two complementary oligonucleotides 
(oligo #3082: 5'-TCA GCC AAT CCC G-3' oligo #3081: 5'-AAT 

35 TCG GGA TTG GC-3' ) . 
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4. PC106SP-C . 

The amino acid sequence of the 144 residue fu- 
sion protein encoded by pC106SP-C is a slight modification 
of the sequence shown in Fig. 8. In plasmid pC106SP-C, 
5 the 106 amino acids of CAT are joined to 35 amino acids of 
mature SP-C through a linker of 3 amino acids (Glu, Phe, 
Asn) . The SP-C portion of the total fusion protein * 
comprises 24%. 

pC106SP-C was constructed by replacing the EcoRi 

10 fragment of pC210SP-C (nt 302 to nt 728, Fig. 8) with two 
sets of complementary oligos which were annealed, then 
ligated together through a region of homology (oligo 
#3079: 5'-AAT TCC GTA TGG CAA TGA AAG ACG GTG AGC TGG TGA 
TAT GGG ATA GTG TTC ACC CTT GT-3' was annealed with oligo 

15 #3085: 5'-ACA CTA TCC CAT ATC ACC AGC TCA CCG TCT TTC ATT 
GCC ATA CGG-3 ' ? oligo #3080: 5'-TAC ACC GTT TTC CAT GAG 
CAA ACT GAA ACG TTT TCA TCG CTC TGG G-3' was annealed with 
oligo #3078: 5'-AAT TCC CAG AGC GAT GAA AAC GTT TCA GTT 
TGC TCA TGG AAA ACG GTG TAA CAA GGG TGA- 3 ' ) . 

20 

5 . Expression From SP-C Vectors . 

Each SP-C expression vector was used to 
transform E. coli strain W3110 to ampicillin resistance. 
Rapidly growing cultures of expression strains were 

25 induced as described above. By 1 hr after induction, 

refractile cytoplasmic inclusion bodies were seen by phase 
contrast microscopy inside the still-growing cells. 5 hr 
after induction, the equivalent of 1 O.D.g^Q of cells were 
pelleted by centrifugation, then boiled for 5 min in SDS 

30 sample buffer for electrophoresis in a 12% SDS- 

polyacrylamide gel followed by staining with Coomassie 
Blue. The results are provided in Fig. 9 wherein Lane A = 
molecular size standards^ Lane B = induced W3110 cells 
containing pTrp233 vector control; Lane C = induced 

35 PC106SP-C; Lane D = pC149SP-C; Lane E - pC179SP-C? Lane F 
= pC210SP-C. The hybrid CAT: SP-C protein produced by each 
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vector is estimated to comprise 15-20% of the total cell 
protein in the induced cultures. 

V. Improved CAT Vectors for Expression of Hybrid Proteins 
5 in Escherichia Coli . 

In the following examples, the basic CAT gene 
fusion vector has been improved in several ways: (1) 
unique cloning sites are created for insertion of the gene 
to be expressed, (2) the CAT gene is modified to optimize 
10 cleavage and/or purification of the peptides, and (3) the 
gene conferring resistance to tetracycline is restored to 
provide an alternative method for plasmid selection and 
maintenance . 

15 A. Expression Vectors pCAT73 and pCAT210 . 

Expression vector pCAT73 contains genes confer- 
ring resistance to both ampicillin and tetracycline, 
unique Eco RI and Hin di I I cloning sites for insertion of 
genes to be expressed, and the amino terminal fragment (1- 

20 73) of the CAT gene. The cleavage site, included with the 
inserted gene, may not be unique. This plasmid is 
constructed from plasmids pBR322, pTrp233, pCAT21, and 
oligodeoxyribonucleotides . Expression vector pCAT210 dif- 
fers from pCAT73 in that it contains the larger amino 

25 terminal fragment (1-210) of the CAT gene from which the 
Eco RI site at the sequence encoding residues 72 and 73 
(Glu-Phe) has been removed. (An alternative codon choice 
preserves the Glu and permits the use of unique EcoR I and 
Hin di I I cloning sites.) Other DNA fragments encoding the 

30 amino terminus of the CAT gene, smaller than 73 amino 
acids or between 73 and 210 amino acids may also be 
constructed by insertion of an EcoR I site at the desired 
fusion point. 



35 
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1. Construction of pCAT73 . 

Restoration of the gene for tetracycline resist- 
ance requires restoring the BamH I -Hind lll- BcoR I fragment 
of pBR322 to the CAT expression vector. Since the unique 
5 cloning sites desired for this vector are EcoR I and 

Hind i! I, this must be done in a manner which removes these 
sites but retains resistance to tetracycline. Since- 
insertion of DNA at the Hin di I I site upstream of the cod- 
ing region often prevents gene expression, this site is 

10 removed by creating a point mutation at the Hin di I I site. 
Plasmid pBR322 f was digested with EcoR I and Hind i I I and 
the vector backbone gel purified. The backbone was 
ligated with synthetic EcoRI -Hindi I fragments, which are 
formed by annealing pairs of oligonucleotides using T4 DNA 

15 ligase. The fragments contain the normal EcoR I -Hind i! I 

sequence with the exception of point mutations (G or C) at 
the first adenine of the recognition sequence 5 ' -AAGCTT- 
3 ' . An intermediate plasmid was isolated from ampicillin- 
resistant and tetracycline-resistant E . coli MC1061 

20 transformants whose plasmid mini-prep DNA was not digested 
by Hindlll . 

A BamHI -EcoRI fragment no longer containing a 
Hind i I I site was purified from agarose gel electrophoresis 
from a BamH I and EcoR I digest of plasmid pTetHl. The 

25 fragment was ligated using T4 DNA ligase with plasmid 

pTrp233 which was also digested with BamH I and EcoR I and 
agarose gel purified. Transformed with the ligation, 
colonies of E. coli MC1061 were selected for ampicillin 
and/or tetracycline resistance. Plasmid pTrpT233 was 

30 resistant to both antibiotics. 

In an alternate embodiment, digestion of 
pTrpT233 with EcoR I, blunting of the termini with DNA 
polymerase I, Klenow fragment, and ligation with T4 DNA 
ligase will eliminate the EcoR I site (which does not 

35 affect resistance to tetracycline) . Tetracycline- 
resistant plasmid pTrpT234 which has lost undesirable 
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Hind i I I and Eco RI sites is isolated from colonies of E. 
coli MC1061 transformed with this ligation. 

The CAT gene is obtained as an Nde l- Hin dlll 
fragment purified by agarose gel electrophoresis of an 
5 Ndel-Hindlll digest of pCAT21. Plasmid pTrpdeltaHind was 
digested with Nde l and Hind i II, purified by agarose gel 
electrophoresis, and ligated with the CAT gene using T4 
DNA ligase. From ampicillin (or tetracycline) resistant 
trans formants of E. coli MC1061 digested with Eco RI and 
10 Hin di I I to verify incorporation of the CAT gene, plasmid 
pCAT73 (Fig. 5A) is isolated. 

2 . Construction of pCAT210 . 
The BamH I- Hin dlll fragment containing the trp 
promoter-operator, ribosome binding site, and CAT gene is 
purified by agarose gel electrophoresis from a BamH I and 
Hin di I I digest of plasmid pCAT21. Site specific 
mutagenesis is carried out on the fragment using M13 and 
mutagenic oligodeoxyribonucleotides to convert the GAA 
codon for Glu to GAG (also to Glu) within the EcoR I site, 
5'-GAATTC-3' . One such plasmid, M13-CATdR, is digested 
with Sea l to linearize the vector and ligated with an 
Eco RI linker (for the same reading frame as in pCAT73) 
using T4 DNA ligase. From the trans fectants, M13-CATR1, 
is isolated and digested with Nde l and Hindlll. The new 
CAT gene is purified by agarose gel electrophoresis and 
ligated using T4 DNA ligase with Nde l- Hin dlll-diqested 
plasmid pTrpT234. Plasmid pCAT210 (Fig. 5B) is isolated 
from ampicillin (or tetracycline) resistant trans formants 
of E. coli MC1061. 

B. Expression Vectors pCAT73-T and pCAT73-M . 
Expression vectors pCAT73-T and pCAT73-M are 
examples in which the amino acid sequence of CAT has been 
35 altered using site specific mutagenesis techniques to 

facilitate purification of the product protein. In these 
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cases, the Trp residue at position 16 may be substituted 
with Tyr and the Met residue at position 67 may be 
substituted by lie or Leu to eliminate potential chemical 
cleavage sites within CAT. In addition, the Cys at posi- 
5 tion 31 may also be substituted using a conservative amino 
acid alteration, that is, substitution with an amino acid 
which does not adversely affect biological activity.. 
Preferred residues include alanine, serine, leucine, 
isoleucine and valine, most preferred is serine. These 
10 latter alterations are intended to reduce multimerization 
through disulfide bridges. 

C. Expression of Modified CAT-GLP-1 

Plasmid pTrpdeltaHind contains the restored Tet R 

15 gene from pTrp233 (although the Hind i I I site has been 

eliminated) , the Trp 16 to Tyr, Cys 31 to Ser, and Met g7 to 
Leu substitutions in the CAT gene sequence, and the GLP-1 
gene (taught in Example III) fused in- frame to the 
modified CAT gene through a methione residue. The vector 

20 was used to transform several E. coli strains including 
W3110, MC1061, DH1, MM294 and RR1. 

E. coli RR1 trans formants were more stable and 
appeared to have better induction/repression control of 
the Trp promoter than any of the other hosts. An 

25 alternative construction for this vector includes 
reversing the Tet gene (to avoid the back-to-back 
placement of the Tet and Trp promoters in the present 
construct) to alleviate the stability problems observed 
using bacterial hosts other than RRI trans formants . 

30 

VI. Construction of pTrpCAT72 ;Adipsin/D . 

The coding sequence for mature human adipsin/D 
was fused to pCAT72 to produce a fusion protein suitable, 
for example, to generate antisera against human adipsin/D. 

35 
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A. Construction of pTrpCAT72 Q3S1 

Plasmid pCAT72 Q3S1 was constructed to eliminate 
Asn residues at which secondary cleavages can occur during 
hydroxylamine release of peptides fused to CAT. The Asn 
5 residues at amino acid positions 26, 51 and 78 of CAT were 
changed to Gin residues. At the same time, the single Cys 
at position 31 was changed to Ser to decrease the amount 
of aggregation seen with many CAT fusion proteins. 

The vector pCAT72 Q3S1 was constructed as fol- 

10 lows: Oligos CAT72-1 through 6 (below) were annealed and 
ligated into pUC-9 which had been cleaved with Ndel and 
Eco RI . In this way, the mutated CAT72 was joined to the 
polylinker region of the pUC plasmid. CAT72 Q3S1 with the 
polylinker was then removed from pUC by cleavage with Nde l 

15 and Hind i II, and inserted into pTrp233 between Nde l and 
Hindi I I to yield pTrpCAT72 Q3S1. 



20 



25 



CAT72-1 

10 20 30 40 50 

TATGGAGAAA AAAATCACTG GATATACCAC CGTTGATATA TCCCAATGGC 

60 70 
ATCGTAAAGA ACATTTTGAG GCATTTCA 

CAT72-2 

10 20 30 40 50 

CAAAATGTTC TTTACGATGC CATTGGGATA TATCAACGGT GGTATATCCA 

60 

TGATTTTTT TCTCCA 



CAT72-3 

10 20 30 40 50 

TCAGTTGCT CAATCTACCT ATCAGCAGAC CGTTCAGCTG GATATTACGG 

30 60 70 80 

CCTTTTTAAA GACCGTAAAG AAACAGAAGC 



CAT72-4 

10 20 30 40 50 

CTTTACGGTC TTTAAAAAGG CCGTAATATC CAGCTGAACG GTCTGCTGAT 

60 70 80 

AGGTAGATTG AGCAACTGAC TGAAATGCCT 



• 
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CAT72-5 

10 20 30 40 50 

ACAAGTTTTA TCCGGCCTTT ATTCACATTC TTGCCCGCCT GATGCAGGCT 

CATCCGG 
CAT72-6 

10 20 30 40 50 

AATTCCGGAT GAGCCTGCAT CAGGCGGGCA AGAATGTGAA TAAAGGCCGG 

60 70 
ATAAAACTTG TGCTTCTGTT T 



B. Construction of pTrpCAT72 Q6S3 

Starting with pCAT72 Q3S1, pCAT153 Q6S3 was 
constructed to change the Asn residues at positions 130, 
141 and 148 of CAT to Gin residues, and to change the Cys 
residues at 91 and 126 to Ser residues. 

Plasntid CAT72 Q3S1 in pUC-9 was cleaved with 
EcoR l . Oligos CAT153-1 through 6 (below) were annealed 
and ligated into pCAT72 to give pCAT153 Q6S3. The 
modified pCAT153 was then removed from pUC by cleavage 
with Kdel and Hind lll , and the resulting fragment inserted 
into pTrp233 to give pTrpCAT153 Q6S3. 



CAT153-1 

10 20 30 40 50 

AATTTCGTAT GGCAATGAAA GACGGTGAGC TGGTGATATG GGATAGTGTT 

60 70 80 

CACCCTTCTT ACACCGTTTT CCATGAGCAA 



CAT153-2 

10 20 30 40 50 

AAAACGGTGT AAGAAGGGTG AACACTATCC CATATCACCA GCTCACCGTC 

60 

TTTCATTGCC ATACGA 



CAT153-3 

10 20 30 40 50 

ACTGAAACGT TTTCATCGCT CTGGAGTGAA TACCACGACG ATTTCCGGCA 

60 70 80 

GTTTCTACAC ATATATTCGC AAGATGTGGC 
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CAT133-4 

10 20 30 40 50 

GCGAATATAT GTGTAGAAAC TGCCGGAAAT CGTCGTGGTA TTCACTCCAG 

60 70 80 

AGCGATGAAA ACGTTTCAGT TTGCTCATGG 

5 

CAT153-5 

10 20 30 40 50 

GTCTTACGGT GAACAGCTGG CCTATTTCCC TAAAGGGTTT ATTGAGCAGA 

60 70 
TGTTTTTCGT CTCAGCCCAG CCCG 

10 CAT153-6 

10 20 30 40 50 

AATTCGGGCT GGGCTGAGAC GAAAAACATC TGCTCAATAA ACCCTTTAGG 

60 70 80 

GAAATAGGCC AGCTGTTCAC CGTAAGACGC CACATCTT 

15 

Next, the human adipsin/D. cDNA hg31-40 (Figure 
10) was constructed. The BamH I - Sty l fragment containing 
the mature coding region was gel purified and inserted 
into pUC-9 which had been cleaved with BamHI and Hin di II . 

20 The Sty l end of the cDNA was joined to the Hind i I I end of 
pUC using two oligos (#3886 5 ' -CATGGGTGCCGGGGCCTGA-3 ' and 
#3887 5 ' -AGCTTCAGGCCCCGGCACC-3 ' ) . By inserting the BamHI - 
Sty l fragment of adipsin/D into pUC in this way, the cod- 
ing sequence of adipsin/D was placed in frame with the 

25 Eco RI site of pUC-9. The Eco RI - Hin di II fragment of this 
construct was removed from pUC-9 and inserted into 
pTrpCAT72 between the EcoR I site and the Hind i I I sites to 
yield pTrpCAT7 2 : Adipsin/D. 

This construct gave 10-15% levels of fusion 

30 protein upon induction in W3110 cells. 

Modifications of the above described modes for 
carrying out the invention that are obvious to those of 
skill in the art of molecular biology, protein, chemistry, 
35 cell biology, or related fields are intended to be within 
the scope of the following claims. 
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The embodiments of the invention in which an 
exclusive property or privilege is claimed are defined as 
follows : 



5 1. A method of stabilizing heterologous protein 

expression in a prokaryotic host comprising: 

(a) constructing a hybrid gene comprising in 
sequential order, a 3 r truncated chloramphenicol 
acetyltransf erase (CAT) gene sequence fused in frame with 

10 a heterologous gene sequence encoding a mammalian 

polypeptide selected from the group consisting of amyloid 
protein A4-751 insert sequence, glucagon-like peptide I, 
adipsin/D, lung surfactant protein SP-B and lung 
surfactant protein SP-C, wherein said polypeptide is 

15 normally not recoverable in bacterial expression systems, 
and wherein said hybrid gene, upon translation, produces a 
fusion protein in a recoverable yield; 

(b) providing a vector for expression of said 
hybrid gene; 

20 (c) culturing the prokaryotic host transformed 

with the expression vector; and 

(d) recovering the fusion protein. 

2. The method of claim 1 wherein said 
25 prokaryotic host is a bacterial cell. 

3. The method of claim 2 wherein said bacterial 
cell is E. coli. 



30 4, The method of claim 1 wherein said 3' 

truncated CAT gene sequence enhances the level of 
heterologous protein present in the total cellular 
protein . 



35 
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5. The method of claim 1 wherein the length of 
the truncated CAT gene sequence encodes a CAT peptide of 
about 73 to about 210 amino acids. 

5 6. The method of claims 1 or 5 wherein said 

hybrid gene further comprises a DMA sequence encoding a 
selective cleavage site located between the CAT gene 
sequence and the heterologous gene sequence. 

10 7. The method of claim 6 wherein said selective 

cleavage site is composed of tryptophan, methionine, • 
asparagine-glycine, or glutamic acid. 

8. A method of stabilizing heterologous protein 
15 expression in a prokaryotic host comprising: 

(a) constructing a hybrid gene comprising in 
sequential order, a 3' truncated chloramphenicol 
acetyl transferase (CAT) gene sequence encoding a CAT 
peptide of about 73 to about 180 amino acids, fused in- 

20 frame with a heterologous gene sequence encoding a 

mammalian polypeptide selected from the group consisting 
of amyloid protein A4-751 insert sequence, glucagon-like 
peptide I, adipsin/D, lung surfactant protein SP-B and 
lung surfactant protein SP-C, wherein said heterologous 

25 protein is normally not recoverable in bacterial 

expression systems, and wherein said hybrid gene, upon 
translation, produces a fusion protein in a recoverable 
yield; 

(b) providing a vector for expression of said 
30 hybrid gene; 

(c) culturing the prokaryotic host transformed 
with the expression vector; and 

(d) recovering the fusion protein. 

35 9 . The method of claim 8 wherein said hybrid 

gene further comprises a DNA sequence encoding a selective 
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cleavage site located between the CAT gene sequence and 
the heterologous gene sequence. 

10. A bacterial expression vector capable of 

5 enhancing the level of expression of non-stable, bacteri- 
ally produced heterologous polypeptides comprising: 

a hybrid gene having in sequential order, a 3' 
truncated CAT gene sequence linked to a heterologous gene 
sequence encoding a mammalian polypeptide selected from 

10 the group consisting of amyloid protein A4-751 insert 
sequence, glucagon-like peptide I, adipsin/D, lung 
surfactant protein SP-B and lung surfactant protein SP-C, 
wherein said polypeptide is normally not recoverable in 
bacterial expression systems, whereby said truncated CAT 

15 gene sequence is capable of rendering the resulting fusion 
protein resistant to proteolytic degradation. 

11. The method of claim 10 wherein the length 
of the truncated CAT gene sequence encodes a CAT peptide 

20 °f about 73 to about 210 amino acids. 

12. The bacterial expression vector of claims 
10 or 11 wherein said hybrid gene further comprises a DNA 
sequence encoding a selective cleavage site located 

25 between the CAT gene sequence and the heterologous gene 
sequence . 

13. The vector of claim 12 wherein the hybrid 
gene having said 3' truncated CAT gene sequence, upon 

30 expression, enhances the level of the heterologous protein 
present in the total cellular protein. 

14. In a bacterial expression vector capable of 
enhancing the level of expression of non-stable, bacteri- 

35 all Y Produced heterologous polypeptides wherein the vector 
comprises a hybrid gene having in sequential order, a 3' 



# 
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truncated CAT gene sequence linked to a heterologous gene 
sequence encoding a polypeptide normally not recoverable 
in bacterial expression systems, said truncated CAT gene 
sequence being capable of rendering the resulting fusion 
5 protein resistant to proteolytic degradation, the 

improvement comprising altering one or more DNA codons of 
the truncated CAT gene to eliminate potential chemical 
cleavage sites within the CAT protein. 

10 15. The improved bacterial expression vector of 

claim 32 wherein the alterations include substituting- the 
DNA encoding a) methionine at position 67 of CAT with DNA 
encoding isoleucine or leucine; (b) cysteine at position 
31 of CAT with DNA encoding serine; or (c) tryptophan at 

15 position 16 of CAT with DNA encoding tyrosine. 



20 



25 



30 



35 
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